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(54) Title: NUCLEIC ACID INlTiGRATION IN EUKARYOTliS 

(57) Abstract: The invention relate to the field of molecular biology and cell biology. It particularly relates to methods to direct 
integration oTa nucleic acid of interest towards homologous recombination and uses thereof. The present invention discloses factors 
involved in iniegrdlion of a nucleic acid by illegitimate recombination which provides a method to direct integration of a nucleic 
acid of interest to a pre -determined site, whereby said nucleic acid has homology at or around the said pre-dclermined site, in a 
eukaryoie with a preference for non-homologous recombination comprising steering an integration pathway towards homologous 
recombination. I'urthcrmorc, the invention provides a method to direct integration of a nucleic acid of interest to a sub-tclomcric 
and/or tclom eric region in a eukaryotc with a preference for non-homologous recombination. 
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Title: Nucleic acid integration in eukaryotes 

The invention relates to the field of molecular biology and cell biology. It 
particularly relates to methods to direct integration towards homologous 
5 recombination and uses thereof. Several methods are known to transfer nucleic 
acids to, in particular, eukaryotic cells. In some methods the nucleic acid of 
interest is transferred to the cytoplasm of the cell, in some the nucleic acid of 
interest is integrated into the genome of the host. Many different vehicles for 
transfer of the nucleic acid are known. For different kinds of cells, different 

10 systems can be used, although many systems are more widely applicable than 
just a certain kind of cells. In plants, e.g., a system based onAgrobacterium 
tumefaciens is often apphed. This system is one of the systems that is used in a 
method according to the invention. 

The soil bacterium Agrobacterium tumefaciens is able to transfer part of 

15 its tumor-inducing (Ti) plasmid, the transferred (T-) DNA, to plant cells. This 
resxilts in crown gall tumor formation on plants due to expression of oiic-genes, 
which are present on the T-DNA. Virulence (vir) genes, located elsewhere on 
the Ti-plasmid, mediate T-DNA transfer to the plant cell. Some Vir proteins 
accompany the T-DNA during its transfer to the plant cell to protect the T- 

20 DNA and to mediate its transfer to the plant nucleus. Once in the plant 

nucleus, the T-DNA is integrated at a random position into the plant genome 
(reviewed by (Hooykaas and Beijersbergen, 1994), (Hansen and Chilton 1999). 
Removal of the on.c-genes from the T-DNA does not inactivate T-DNA transfer. 
T-DNA, disarmed in this way, is now the preferred vector for the genetic 

25 modification of plants. 

Although much is known about the transformation process, not much is 
known about the process by which the T-DNA is integrated into the plant 
genome. It is likely that plant enzymes mediate this step of the transformation 
process (Bundock at al. 1995). The integration pattern of T-DNA in 

30 transformed plants has been extensively studied (Matsumoto at al. 1990) 
(Gheysen et al 1991) (Meyerhofer et al 1991). The results indicated that T- 
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DNA integrates via illegitimate recombination (IR) (also called non- 
homologous recombination, both terms may be used interchangeable herein), a 
process which can join two DNA molecules that share little or no homology 
(here the T-DNA and plant target DNA). Even T-DNA molecules in which a 
5 large segment of homologous plant DNA was present, integrated mainly by~IR 
and only with very low frequency (IrlO^-lO^) by homologous recombination 
(HR) (Offi-inga et al. 1990). 

Recently, it was shown that Agrobacterium is not only able to transfer 
-^^itsJT-DNA to plant cells, but also to other eukaryotes, including the yeast 

10 S.cerevisiae (Bundock et al. 1995) and a wide variety of filamentous fungi (de 
Groot et al. 1998). In S.cerevisiae, T-DNA carrying homology with the yeast 
genome integrates via HR (Bundock et al. 1995). However, T-DNA lacking any 
homology with the S.cerevisiae genome becomes integrated at random 

\^ positions in the genome by the same IR process as is used in plants (Bimdock 

15 and Hooykaas 1996). Apparently, eukaryotic cells have at least two separate 
pathways (one via homologous and one via non-homologous recombination) 
through which nucleic acids (in particular of course DNA), can be integrated 
into the host genome. The site of integration into a host cell genome is 
important with respect to the likelihood of transcription and/or expression of 

20 the integrated nucleic acid. The present invention provides methods and 
means to direct nucleic acid integration to a pre-determined site through 
steering integration towards the homologous recombination pathway. The 
present invention arrives at such steering either by enhancing the HR 
pathway, or by inhibiting (meaning reducing) the IR pathway. 

25 Host factors involved in the integration of nucleic acid by IR have so far 

^J?* J^^®^ identified. The present invention discloses such factors which enables 
the design of methods for their (temporary) inhibition, so that integration of 
nucleic acid by IR is prevented or more preferably completely inhibited, 
shifting the integration process towards HR and facilitating the isolation of a 

30 host cell with nucleic acid integrated by HR at a predetermined site. This is 
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extremely important, since there is no method available yet for easy and 
precise genetic modification of a host cell using HR (gene targeting). Of course 
the actual site of integration is then determined by homology of the nucleic 
acid of interest with said site. 
5 In a first embodiment the invention provides a method to direct nucleic 

acid integration of a nucleic acid of interest to a pre-determined site, whereby 
said nucleic acid has homology at or around the said pre-determined site, in a 
eukaryote with a preference for non-homologous recombination comprising 
steering an integration pathway towards homologous recombination. 

10 Preferably, such a method comprises at least the steps of introducing said 

nucleic acid of interest to a cell of said eukaryote, for example by the process of 
transformation or electroporation, and integration of said nucleic acid in the 
genetic material of said celL Integration is a complex process wherein a nucleic 
acid sequence becomes part of the genetic material of a host cell. One step in~^^ 

15 the process of nucleic acid integration is recombination; via recombination 
nucleic acid sequences are exchanged or inserted and the introduced nucleic 
acid becomes part of the genetic material of a host cell. In principle two 
different ways of recombination are possible: homologous and illegitimate or 
non-homologous recombination. Most (higher) eiikaryotes do not or at least not 

20 significantly practise homologous recombination although the essential 
proteins to accomplish such a process are available. One reason for this 
phenomenon is that firequent use of homologous recombination in (higher) 
eukaryotes could lead to undesirable chromosomal rearrangements due to the 
presence of repetitive nucleic acid sequences. fTo accompKsh homologous 

25 recombination via a method according to the invention, it is important to 
provide a nucleic acid which has homology with a pre-determined site. It is 
clear to a person skilled in the art that the percentage of homology and the 
length of (a) homologous region(s) play(s) an important role in the process of 
homologous recombination. The percentage of homology is preferably close to 

30 100%. A person skilled in the art is aware of the fact that lower percentage of 
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homology are also used in the field of homologous recomhination, but 
dependent on, for example, the regions of homology and their overall 
distribution, can lead to a lower efBciency of homologous recombination but 
are still useftd and therefore included in the present invention. Fxirthermore, 
5 the length of a (nearly) homologous region is approximately 3 kb which is ^ - 
sufficient to direct homologous recombination. At least one homologous re^on 
is necessary for recombination but more preferably 2 homologous regions 
flanking the nucleic acid of interest are used for targeted integration. The 
researcher skilled in the art knows how to select the proper percentage of 

10 homology, the length of homology and the amount of homologous regions. By 
providing such a homology a nucleic acid is integrated at every desired 
position within the genetic material of a host cell. It is clear to a person skilled 
in the art that the invention as disclosed herein is used to direct any nucleic 
acid (preferably DNA) to any pre-determined site as long as the length of 

15 homology and percentage of homology are high enough to provide/enable 

homologous recombination. A pre-determined site is herein defined as a site 
within the genetic material contained by a host cell to which a nucleic acid 
with homology to this same site is integrated with a method according to the 
invention. It was not until the present invention that a nucleic acid is 

20 integrated at every desired position and therefore a method according to the 
invention is applied, for example, to affect the gene function in various ways, 
not only for complete inactivation but also to mediate changes in the 
expression level or in the regulation of expression, changes in protein activity 
or the subcellular targeting of an encoded protein. Complete inactivation, 

25 which can usually not be accompUshed by existing methods such as antisense 
technology or RNAi technology (Zrenner et al, 1993), is useful for instance for 
the inactivation of genes controlling undesired side branches of metabolic 
pathways, for instance to increase the quahty of bxilk products such as starch, 
or to increase the production of specific secondary metabohtes or to inhibit 

30 formation of unwanted metabolites. A method according to the invention is 
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also used to inactivate genes controlling senescence in fniits and flowers or 
that determine flower pigments. Replacement of existing regulatory sequences 
by alternative regulatory sequences is used to alter expression of in situ 
modified genes to meet requirements, (e.g. expression in response to particular 
5 physical conditions such as Ught, drought or pathogen infection, or in response 
to chemical inducers), or depending on the developmental state (e.g. in a 
storage organ, or in fruits or seeds) or on tissue or cell types. Also a method 
according to the invention is used to effectuate predictable expression of 
transgenes encoding novel products, for example by replacing existing coding 
10 sequences of genes giving a desired expression profile by those for a desired 

novel product. For example to produce proteins of medicinal or industrial value 
in the seeds of plants the coding sequence of a strongly expressed storage 
protein may be replaced by that of the desired protein. As another example 
existing coding sequences are modified so that the protein encoded has 
15 optimized characteristics for instance to make a plant herbicide tolerant, to 
produce storage proteins with enhanced nutritional value, or to target a 
protein of interest to an organelle or to secrete it to the extraceUular space. As 
yet another example eukaryotic cells (including yeast, fungus, plant, 
mammahan cells or (non-human) animal cells) are provided with a gene 
20 encoding a protein of interest integrated into the genome at a site which 

ensures high expression levels. As another example the nucleic acid of interest 
can be part of a gene delivery vehicle to deHver a gene of interest to a 
eukaryotic cell iri vitro or in vivo. In this way a defect p53 can be replaced by 
an intact p53. In this way a tumoricidal gene is delivered to a pre-determined 
25 site present only in e.g. proUferating cells, or present only in tumor cells, e.g. 
to the site where a tumor antigen is expressed from. Gene dehvery vehicles are 
well known in the art and include adenoviral vehicles, retroviral vehicles, non- 
viral vehicles such as liposomes, etc. As another example the invention is used 
to produce transgenic organisms. Knock-out transgenics are already produced 
30 by homologous recombination methods. The present invention improves the 
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efficiency of such methods. Also transgenics with desired properties are made. 
It is clear to a person skilled in the art that transgenics can for example be 
made by the use of Agrobacterium as gene deUvery vehicle for plant (Verg\mst 
et al., 1998), yeast (Bundock et al., 1995), fungus (de Groot et al., 1998) or 
5 animal (Kunik et al., 2001) or by direct DNA delivery methods exemplifiedl)y 
but not restricted to electroporation for yeast (Gietz & Woods, 2001), plant* 
(D'Halluin et al., 1992; Lin et al., 1997), fungus (Ozeki et al, 1994) and animal 
(Templeton et al., 1997), LiCl treatment for yeast (Schiestl et al., 1993), micro- 
injection for plant (Schnorf et al., 1991) and animal (Capecchi, 1980) and 

10 "DNA whiskers" for plant (Kaeppler et al., 1990; Dxmwell, 1999) or particle 
bombardment for plants and animals (Klein et al., 1992). It is furthermore 
clear that transgenic plants can be obtained via selective regeneration of 
transformed plant cells into a complete fertile plant (Vergunst et al., 1998) or 
via non-regenerative approaches by transforming germ line cells, exemplified 

15 by but not restricted to dipping Arabidopsis flowers into an Agrobacterium 

suspension (Bechtold et al., 1993). It is also clear that transgenic animals can 
be obtained by transforming embryonic stem cells with one of the DNA 
delivery methods mentioned above (Hooper, 1992). 

In another embodiment the invention provides a method to direct 

20 nucleic acid integration to a pre-determined site, whereby said nucleic acid has 
homology at or aroimd the said pre-determined site, in a eukaryote with a 
preference for non-homologous recombination comprising steering an 
integration pathway towards homologous recombination by providing a 
mutant of a component involved in non-homologous recombination. Methods to 

25 identify components involved in non-homologous recombination are outlined in 
the present description wherein S,cerevisiae was used as a model system. To 
this end several yeast derivatives defective for genes known to be involved in 
various recombination processes were constructed and the effect of the 
mutations on T-DNA integration by either HR or IR was tested. The results as 

30 disclosed herein show that the proteins encoded by YKU70, RAD 50, MREll, 
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XRS2, LIG4 and SIR4 play an essential role in DNA integration by IR but not 
by H R. WO 00/12716 describes a maize Ku70 orthologue and suggests that 
"Control of homologous recombination or non-homologous end joining by 
modulating Ku provides the means to modulate the efficiency which 
5 heterologous nucleic acids are incorporated into the genomes of a target plant 
cell." WO 00/68404 describes a maize Rad50 orthologue and suggest an 
analogous control for Rad50. Both patent applications do however no t disclose, 
in contrast to the present patent application, that by preventing or more 
preferably completely inhibiting non-homologous recombination, for example 

10 by providing a mutant of a component involved in non-homologous 

recombinationjor^b component, that the integration 

pathway is steered towards homologous recombination. It is clear to a person 
skilled in the art that different mutants of a component involved in non- 
homologous recombination exist. Examples are deletion mutants, knock-out 

15 (for example via insertion) mutants or point mutants. Irrespective of the kind 
of mutant it is important that a component involved in non-homologous 
recombination is no longer capable or at least significantly less capable to 
perform it's function in the process of non-homologous recombination. As 
disclosed herein disruption of YKU70, RAD50, MREll, XRS2, LIG4 and SIR4 

20 did not affect the firequency of DNA integration by HR, showing that these 

genes are not involved in DNA integration by HR, but only in DNA mtegration 
by IR. More over, in the wild-type yeast strain 85% of the integration events 
occurred by HR (37% by replacement and 63% by insertion) and 15% by IR. In 
contrast, integration occurred only by HR in yeast strains lacking ku70 or lig4, 

25 In radSO and xrs2 mutant strains the T-DNA preferentially integrated by HR 
(92%) and 93% of these T-DNAs integrated by replacement and only 7% by 
insertion. Thus, the absence of a functional radSO or ocrs2 gene leads to a 
significantly increased firequency of replacement reactions. 

In another embodiment the invention provides a method to direct 

30 integration of a nucleic acid of interest to a sub-telomeric and^or telomeric 
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region in a eukaryote with a preference for non-homologous recomhination by 
providing a mutant of a component involved in non-homologous recombination. 
A telomeric region is typically defined as a region containing repetitive 
sequences which is located at the end of a chromosome. A sub-telomeric region 
is typically defined as a region flanking the telomeric region. As an example It 
is disclosed herein that in yeast strains carrying disruptions of RAD50, 
MREll or XRS2 the distribution of integrated DNA copies is altered when 
compared to wildt5Tpe. DNA becomes preferentially integrated in telomeres or 
^ubtelomeric regions in the radSO, mrell and ocrs2 mutants. A great advantage 
of integration of DNA copies in telomeres or subtelomeric regions instead of 
integration elsewhere in the genomic material is that there is no danger for 
host genes being mutated or inactivated by a DNA insertion. When in plants 
deficient for RAD50, MREll or XRS2 DNA copies also integrate into telomeres 
or subtelomeric regions, such plants are used for (sub)telomeric targeting of T- 
DNA in transformation experiments to prevent additional insertion mutations 
firom random T-DNA integration into the plant genome. 

In yet another embodiment the invention provides a method to 
direct nucleic acid integration to a pre-determined site, whereby said nucleic 
acid has homology at or around the said pre-determined site, in a eukaryote 
with a preference for non-homologous recombination comprising steering an 
integr^ation pathway towards homologous recombination by partially or more 
preferably completely inhibiting a component involved in non-homologous 
recombination. Partial or complete inhibition of a component involved in non- 
homologo us re combination is obtained by different methods, for example by an 
antibody direpted against such a component or a chemical inhibitor or a 
protein inhibitor or peptide inhibitor or an antisense molecule or an RNAi 
moleciile. Irrespective of the kind of (partial or more preferably complete) 
inhibition it is important that a component involved in non-homologous 
recombination is no longer capable or at least significantly less capable to 
perform it's function in the process of non-homologous recombination. In yet 
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another embodiment the invention provides a method to direct integration of a 
nucleic acid of interest to a sub-telomeric and/or telomeric region in a 
eukaryote with a preference for non-homologous recombination by partially or 
more preferably completely inhibiting a component involved in non- 
5 homologous recombination. Preferably, said component involved in non- 
homologous recombination is rad50, mrell or xrs2. 

In a preferred embodiment the invention provides a method to direct 
nucleic acid integration to a pre-determined site or to a sub-telomeric and/or 
telomeric region by providing a mutant of a component involved in non- 
10 homologous recombination or by partially or more preferably completely 

inhibiting & component involved in non-homologous recombination wherein 
said component comprises ku70, radSO, mrell, xrs2, lig4, sir 4 or others such 
as ku80 (Tacciole et al„ 1994; Milne et al., 1996), lifl (Teo and Jackson, 2000; 
XRCC4 in human, see figure 6; Jtmop et al., 2000) and nejl, (Kegel et al., 
15 2001; Valencia et al., 2001). Components involved in non-homologous 
recombination are identified as outlined in the present description. The 
nomenclatvire for genes as used above is specific for yeast. Because the 
nomenclature of genes differs between organisms a functional equivalent or a 
functional homologue (for example NBSl, a human ocrs2 equivalent (Paull and 
20 Gellert, 1999) and see for example figure 2 to 5) and/or a functional fragment 
thereof, all defined herein as being capable of performing (in function, not in 
amount) at least one function of the yeast genes ku70, radSO, mrell, xrs2y lig4, 
sir 4, ku80, lifl or nejl are also included in the present invention. A mutant of 
a component directly associating with a component involved in non- 
25 homologous recombination or (partial or complete) inhibition of a component 
directly associating with a component involved in non-homologous 
recombination is also part of this invention. Such a component directly 
associating with a component involved in non-homologous recombination is, 
for example, identified in a yeast two hybrid screening. An example of a 
30 component directly associating with a component involved in non-homologous 
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recombination is KU80, which forms a complex with KU70, In a more 
preferred embodiment the invention provides a method to direct nucleic a<nd 
integration in yeast, fungus, plant or (non-human) animal (cells). 

In another embodiment the invention provides a method to direct 
5 nucleic acid integration to a pre-determined site, whereby said nucleic acidTbas 
homology at or aroxmd the said pre-determined site, in a eukaryote with a* 
preference for non-homologous recombination comprising steering an 
integration pathway towards homologous recombination by transiently 
(partially or more preferably completely) inhibiting integration via non- 
10 homologous recombination. In yet another embodiment the invention provides 
a method to direct integration of a nucleic acid of interest to a subtelomeric 
and/or telomeric region in a eukaryote with a preference for non-homologous 
recombination by transiently (psirtially or more preferably completely) 
inhibiting integration via non-homologous recombination. In a more preferred 
15 embodiment, such a method is used for yeast, plant, fungus or (non-human) 
animal and the transient (partial or more preferably complete) inhibition is 
provided by a (preferably stably) inserted and expressed chimeric transgene 
that encodes a peptide inhibitory to one, some or all non-homologous 
recombination (NHR) enzymes fused to a nuclear localisation signal (Hanover, 
20 1992; Raikhel, 1992) and the steroid-binding domain of a steroid receptor 
(Picard et al., 1988). The chimeric transgene is constructed in such a way, 
using either heterologous or non-heterologous promoter sequences and other 
expression signals, that it provides (stable) expression in the target cells or 
tissue for transformation. In the absence of the steroid hormone, the steroid- 
25 binding domain binds to chaperone proteins, and thereby the fusion protein is 
retained in the cytoplasm. Upon treatment with the steroid hormone, the 
chaperones are released from the steroid-binding domain and the inhibitory 
peptide will enter the nucleus where it will interact with and inhibit the action 
of NHR enzymes. An example of an inhibitory peptide is a KU80 fragment that 
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imparts radiosensitivity to Chinese hamster ovary cells (Marangoni et al., 
2000). 

In a more preferred embodiment such a method is used for yeast, 
plant, fungus or a (non-human) animal and the transient (partial or more 
5 preferable complete) inhibition is provided by an Agrobacterium Vir-fusion 
protein capable of (partially or more preferably completely) inhibiting a 
component involved in non-homologous recombination or capable of (partially 
or more preferably completely) inhibiting a functional equivalent or homologue 
thereof or capable of (partially or more preferably completely) inhibiting a 

10 component directly associating with a component involved in non-homologous 
recombination. In an even more preferred embodiment such an Agrobacterium 
Vir fusion protein comprises VirF or VirE2. It was shown that the 
Agrobacterium VirF and VirE2 proteins are directly transferred from 
Agrobacterium to plant cells during plant transformation (Vergunst et al. 

15 2000). To, for example, accomplish T-DNA integration by HR in plants, VirF 
fusion proteins containing for example a peptide inhibitor of IR in plant cells 
are introduced concomitantly with the targeting T-DNA. It has been reported 
that the C-terminal part (approximately 40 amino acids) of VirF or VirE2 is 
sufficient to accomplish transfer of T-DNA. A functional fragment and/or a 

20 functional equivalent of VirF or VirE is therefore also included in the present 
invention. Preferably, said nucleic acid of interest is deUvered to a cell of said 
exikaryote by Agrobacterium. 

In an even more preferred embodiment a component involved in non- 
homologous recombination comprises ku70, radSO, mrell, xrs2, lig4, sir4, 

25 ku80y lifl or tnejl or functional equivalents or homologous thereof or 

associating components. The nomenclature for genes as used above is specific 
for yeast. Because the nomenclature of genes differs between organisms a 
functional equivalent or a functional homologue (see for example figure 2 to 5) 
and/or a functional firagment thereof, all defined herein as being capable of 

30 performing (in function, not in amount) at least one function of the yeast genes 
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ku70, radSO, mrell, xrs2, lig4, sir4, ku80, lifl or nejlaxe also included in the 
present invention. By transiently (partially or more preferably completely) 
inhibiting a component involved in non-homologous recombination a nucleic 
acid is integrated at any desired position without permanently modifying a 

5 component involved in non-homologous recombination and preventing 

xmwanted side effects caused by the permanent presence of such a modified 
component involved in non-homologous recombination. 

Methods according to the present invention, as extensively but not 
limiting discussed above, are used in a wide variety of applications. One 

10 embodiment of the present invention is the replacement of an active gene by 
an inactive gene according to a method of the invention. Complete 
inactivation, which can usually not be accomphshed by existing methods such 
as antisense technology or RNAi technology, is useful for instance for the 
inactivation of genes controlling undesired side branches of metabohc 

15 pathways, for instance to increase the quality of bulk products such as starch, 
or to increase the production of specific secondary metabolites or to inhibit 
formation of unwanted metaboHtes. Also to inactivate genes controlling 
senescence in fi-uits and flowers or that determine flower pigments. Another 
embodiment of the present invention is the replacement of an inactive gene by 

20 an active gene. One example is the replacement of a defect p53 by an intact 

p53. Many tumors acquire a mutation in p53 during their development which . 
renders it inactive and often correlates with a poor response to cancer therapy. 
By replacing the defect p53 by an intact p53, for example via gene therapy, 
conventional anti cancer therapy have better changes of succeeding. In yet 

25 another embodiment of the invention a therapeutic proteinaceous substance is 
integrated via a method of the invention. In this way a tumoricidal gene ise 
delivered to a pre-determined site present only in e.g! proliferating cells, or 
present only in tumor cells, e.g. to the site where a tumor antigen is expressed 
from. In yet another embodiment the invention provides a method to introduce 

30 a substance conferring resistance for an antibiotic substance to a cell according 
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to a method of the invention. Also a method according to the invention is used 
to confer a desired property to an eukaryotic cell. In a preferred embodiment a 
gene delivery vehicle is used to deliver a desired nucleic acid to a pre- 
determined site. Gene deUvery vehicles are well known in the art and include 
5 adenoviral vehicles, retroviral vehicles, non-viral vehicles such as liposomes, 
etc.. In this way, a for example, tumoricidal gene can be delivered to a pre- 
determined site present only in e.g. proliferating cells, or present only in txxmor 
cells, e.g. to the site where a tumor antigen is expressed from. 

Furthermore a method according to the invention is used to improve 

10 gene targeting efficiency. Such a method is used to improve for example the 
gene targeting efficiency in plants. In plants transgenes integrate randomly 
into the genome by IR (Mayerhof et al. 1991), (Gheysen et al. 1991). The 
efficiency of integration by HR is very low, even when large stretches of 
homology between the transgene and the genomic target site are present 

15 (Offringa et al. 1990). Therefore, the efficiency of gene targeting using HR is 
very low in plants. The results that are disclosed herein show how to improve 
the gene targeting efficiency in plants. From the fact that T-DNA integration 
by IR is strongly reduced in KU70, RAD50, MREll, XRS2, LIG4 and SIR4 
deficient yeast strains and T-DNA integration by HR is not affected in such 

20 strains, T-DNA integration by HR is more easily obtained in plants, deficient 
for either of these genes. Recently, we have cloned a KU70 homologue of 
Arabidopsis thaliana (see figure 2, Bundock 2000, unpubUshed data). RAD50, 
MREll and LIG4 homologues have already been found in A^thaliana 
(GenBank accession numbers AF168748, AJ243822 and AF233527, 

25 respectively^ see also figure 3, 4 and 5 (Hartung and Puchta 1999). Currently, 
screenings are being performed to find plants carrying a T-DNA inserted in 
AtMREll, AtKUTO or AtLIG4, These knockout plants are used to test whether 
T-DNA integration by IR is reduced and integration by HR is essentially 
unaffected, thereby facilitating the detection of T-DNA integration by HR. 
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Fxirthermore, the invention provides a method to direct integration of a 
nucleic acid of interest to a pre-determined site, whereby said nucleic acid~has 
homology at or around the said pre-determined site, in a eukaryote with a 
preference for non-homologous recombination, comprising steering an 
5 integration pathway towards homologous recombination, wherein said nucleic 
acid sequence of interest is essentially replacing a sequence within said 
exikaryote. As disclosed herein within the experimental part, in the wild-type 
yeast strain 85% of the integration events occurred by HR (37% by 
replacement ai^d 63% by insertion) and 15% by IR. In contrast, integration 
10 occurred only by HR in yeast strains lacking ku70 or lig4. In radSO and ocrs2 
mutant strains the T-DNA preferentially integrated by HR (92%) and 93% of 
these T-DNAs integrated by replacement and only 7% by insertion. Thus, the 
absence of a functional radSO or ocrs2 gene leads to a significantly increased 
frequency of the desired replacement reactions. 

15 

The invention will be explained in more detail in the following 
description, which is not limiting the invention. 
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EXPERIMENTAL PART 
Yeast strains. 

The yeast strains that were used are hsted in Table 1. Yeast mutants 
5 isogenic to the haploid YPH250 strain were constructed using the one-step 
disruption method (Rothstein 1991). A 1987 bp fragment from the YKU70 
locus was amplified by PGR using the primers hdflpl 5'- 
GGGATTGCTTTAAGGTAG-y and hdflp2 5'-CAAATACCCTACCCTACC-3\ 
The PGR product was cloned into pTTBlue (Novagen) to obtain 

10 pT7Blue YKT770. A 1177 bp EcoRYIHin&lll fragment from the YKU70 ORF 
was replaced by a 2033 bp Hindlll/Smal LEU2 containing fragment from 
pJJ283 (Jones and Prakash 1990), to form TpT7BlueYKU70::LEU2. In order to 
obtain YKU70 disruptants Leu+ colonies were selected after transformation of 
YPH250 with a 2884 bp NdeVSmal fragment from pT7BlMeYKU70::LEU2. 

15 The Expand™ High Fidelity System (Boehringer Mannheim) was used 
according to the supplied protocol to amplify a 3285 bp fragment from the 
LIG4 locus with primex^s 

dnl4pl 5'-CGTAAGATTCGCCGAGTATAG-3' and 

dnl4p2 5'-CGTTTCAAATGGGACCACAGC-3\ The PGR product was cloned 
20 into pGEMT (Promega), resulting in pGEMTL/G4. A 1326 bp BamBI/Xhol 
fragment from pJJ215 (Jones and Prakash 1990) containing the HISS gene 
was inserted into the BamHl and ^lol sites of pIC20R, resulting in 
VIG20RHIS3. A 782 bp EcoRl fragment from the LIG4 ORF was replaced with 
a 1367 bp EcoRI HISS containing fragment from pIC20RiJISo? to construct 
25 pGEMTLIG4::HIS3. In order to obtain LIG4 disruptants His+ colonies were 
selected after transformation of YPH250 with a 3854 bp Ncol/Notl fragment 
from pGm/LTLIG4::HIS3, In order to obtain RAD50 disruptants YPH250 was 
transformed with a JEcoRI/JB^ZII fragment from pNKY83 and Ura+ colonies 
were selected (Alani et al 1989). A rad50::hisG strain was obtained by 
30 selecting Ura* colonies on selective medium containing 5-FOA. Similarly 
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RAD51 disruptants were obtained were obtained after transformation of 
YPH250 with a RAD51::LEU2 Xbal/Pstl fragment from pDG152 and selection 
of Leu+ colonies (Schiestl et al 1994). The TRPl marker in pSM21 (SchUd et al, 
1983) was replaced with a Bglll/Xbal LEU2 containing fragment fr-om p JJ283 
5 (Jones and Prakash, 1990). This resulted in ^SM21LEU2. Leu+ RAD52 

disruptant colonies were selected after transformation of YPH250 with the 
RAD52::LEU2 BamHI fragment from pSM21LEU2. Disruption constructs 
were transformed to YPH250 by the hthium acetate transformation method as 
described (Gietz et al. 1992; Schiestl et al. 1993). Disruption of YKU70, LIG4, 
10 RAD50, RAD51 and BAD52 was confirmed by PGR gmd Southern blot 
analysis. 
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Table 1: Yeast strains 



Strain 


Genotype 


Reference 


YPH250 


MATa, ura3-52, lys2-801, ade2-101, 


(Sikorski and 




trpl-Al, his3-A200, Ieu2-Al 


Hieter 1989) 


YPK250radSl 


MATa, ura3-S2, ly82-801, ade2-101. 


This study 




trpl-Al, his3-A200, Ieu2-Al, 






rad.51::LEU2 




Yl?H.250rad52 


MATa, ura3-52, lys2-801, ade2-101, 


This study 




trpl-Al, his3-A200, Ieu2-Al, 






rad52::LEU2 




YPB.250yku70 


MATa, ura3-52, lys2-801, ade2-101, 


This study 




trpl-Al, his3-A200, Ieu2-Al. 






yku70::LEU2 




YPH250rad50 


MATa, ura3-52, lys2-801, ade2-101, 


This study 




trpl-Al, his3-A200, leu2-Al,rad50::hisG 




YFli250lig4 


MATa, ura3-52, lys2-801, ade2-101, 


This study 




trpl-Al, his3-A200, Ieu2-Al, lig4::HIS3 




JKM115 


Alio, Ahml::ADEl, MATa, Ahmr::ADEl, 


(Moore and 




adel, leu2-3,112, lys5, trpl::hisG, 


Haber 1996) 




ura3-52 




JKM129 


Aho, Ahml::ADEl, MATa, ^mr::ADEl, 


(Moore and 




adel, leu2-3,112, lys5, trpl::hisG, 


Haber 1996) 




ura3-52, xrs2::LEU2 




dx\JLVX±<3o 


AtlO, AriJ7lL,,/±UJl/ly lVJu\X a, Ari7TlT*,jnJLJllt 1, 


(Moore and 




adel, leu2'3,112, lysS, trpl::hisG, 


Haber 1996) 




ura3-52, mrell::hisG 




YSL204 


Aho, HMLa, MATa, HMRa, adel-lOO, 


(Lee et al. 




leu2-3,112, lysS, trpl::hisG, ura3'52. 


1999) 




hisG'-URA3-hisG\ sir4::HIS3 
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Construction of binary vectors. 

To construct pSDMSOGG a 1513 bp Pvull/EcoRY fragment carrying the 
KanMXmsxkBT was obtained from pFA6a (Wach et al. 1994) and was ligated 
into the unique Hpal site of pSDMl4 (Offringa 1992). pSDMSOOl was made in 
5 three cloning steps. A 1476 bp BaniHI/EcoBI fragment carrying the KanMX - 
marker was obtained from pFA6a and Ugated into JSaTnHI and EcoBI digested 
pIC20H to form pIC20HfeanMX. The KanMX marker was inserted between the 
PDAl flanks by replacement of a 2610 bp JB^ZII fragment from pUC4ElalO 
(Steensma et al. 1990) with a 1465 S^ZII fragment from pIC20HifeanMX. A 
10 3721 bp Xhol/Kpnl fragment from this construct was inserted into the Xhol 
and Kpnl sites of pSDM14. The binary vectors pSDMSOOO and pSDMSOOl 
were introduced into Agrobacterium tumefaciens LBA1119 by electroporation 
(den Dulk-Ras and Hooykaas 1995). 

15 Cocultivations / T-DNA transfer experiments. 

Cocultivations were performed as described earlier with slight 
modifications (Bundock et al. 1995). Agrobacterium was grown overnight in LC 
medixmi. The mix of Agrobacterium and S. cerevisiae cells was incubated for 9 
days at 20°C. G418 resistant S.cerevisiae strains were selected at 30°C on 
20 YPAD medixmi containing geneticin (200 ^ig/ml) (Life Technologies/Gibco 
BRL). 

Vectorette PGR. 

Chromosomal DNA was isolated using Qiagen's Genomic Tips G/20 per 

25 manufacters protocol. 1-2 jig of Genomic DNA was digested with ^coRI, Clal, 
Pstl or Hijidlll and run on a 1% TBE-gel. Non-radioactive Southern blotting 
was performed. The membrane was hybridized with a digoxigenine-labeled 
kanMX probe to determine the size of T-DNA/genomic DNA fragments (EcoRl 
and Clal for RB containing fr-agments and Pstl and Hindlll for LB containing 

30 fragments). The kanMX probe, a 792 bp internal fragment of the KanMX 
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marker, was made by PGR using primers kanmxpl 5'- 
AGACTCACGTTTCGAGGCC-3' and kanmxp2 5'- 

TCACCGAGGCAGTTCCATAG-3' and a Non-Radioactive DNA Labeling and 
Detection kit (Boehringer Mannheim). The enzjnne showing the smallest band 
5 on blot was used for Vectorette PGR, in order to amplify the smallest junction 
sequence of T-DNA and genomic DNA. Vectorette PGR was performed as 
described 

(hti5).7/genomewww.stanford.edu/group/botlab/protocols/vectorette.html). The 
Expand™ High Fidelity System (Boehringer Mannheim) was used to amplify 
10 fragments larger than 2.5 kb, whereas sTaq DNA polymerase (SphaeroQ) was 
used for amplification of fi-agments smaller than 2.5 kb. Primer kanmxp3 5'- 
TCGCAGGTCTGGAGCGAGGAGC-3' and kanmxp4 5'- 

TGGCCTCGACATCATGTGCCCAG-3' were vised to amplify RB/genomic DNA 
and LB/genomic DNA junction sequences, respectively. 

15 

T7 DNA Polymerase sequencing. 

Vectorette PGR products were cloned in pGEMTEasy (Promega) and 
sequenced using the T7 polymerase sequencing kit (Pharmacia) according to 
manxifacturers protocol. In order to obtain sequences flanking the RB and LB, 
20 primers kanmxp5 5'-TGAGATGATGGGGGTGAGGTGC-3' and kanmxp4 were 
used, respectively. 
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RESULTS 

1. Binary vectors for T-DNA transfer to yeast. 

It was previously demonstrated that Agrobacterium tumefaciens is able 
5 to transfer its T-DNA not only to plants but also to another eukaryote, namely 
the yeast Saccharomyces cerevisiae (Bundock et al. 1995). T-DNA carrsdng- 
homology with the yeast genome was shown to become integrated by 
homologous recombination. T-DNA lacking any homology with the yeast 
genome was integrated randomly into the genome by IR like in plants 

10 (Bundock et al. 1995, (Bundock and Hooykaas 1996). The T-DNA used in these 
experiments carried the S.cerevisiae URA3 gene for selection of Ura+ colonies 
after T-DNA transfer to the haploid yeast strain RSY12(C7JJ:A5z1). However, in 
this system only yeast strains could be used in which the URA3 gene had been 
deleted to avoid homology between the incoming T-DNA and the S, cerevisiae 

15 genome. 

We wanted to setup a system in which T-DNA transfer to any yeast 
strain could be studied. Therefore, two new binary vectors were constructed 
using the dominant marker fea/tMX (Wach et al 1994)which confers resistance 
against geneticin (G418). The T-DNA of pSDMSOOO carries only the KanMX 

20 marker. Since this KanMX marker consists of heterologous DNA, lacking any 
homology with the S. cerevisiae genome, we would expect this T-DNA to 
integrate by IR at a random position in the yeast genome. To be able to 
compare this with T-DNA integration by homologous recombination 
pSDMSOOl was constructed. The T-DNA of pSDMSOOl carries the KanMX 

25 marker flanked by sequences from the S.cerevisicce PDAl locus. The PDAl 

sequences have been shown to mediate the integration of T-DNA by HR at the 
PDAl locus on chromosome V (Bundock et al. 1995). 

Cocultivations between Agrobacterium. strains carrying pSDMSOOO and 
pSDMSOOl, respectively, and the haploid yeast strains YPH250 and JKM115, 

30 respectively, were carried out as described in the experimental part. G418 
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resistant colonies were obtained at low frequencies for YPH250 (1.6 x lO 'O and 
JKM115 (1.2 X 10-5) after T-DNA transfer from pSDMSOOO (Table 2). T-DNA 
transfer from pSDMSOOl generated G418 resistant colonies at higher 
frequencies (2.4 x lO-^ for YPH250 and 1.8 x 10-4 JKM115, Table 2). The ratio 
5 of homologous recombination versus illegitimate recombination is determined 
by comparing the frequencies of G418 resistant colonies obtained from 
cocultivations using either pSDMSOOl or pSDMSOOO. This showed that a T- 
DNA from pSDMSOOl was 150-fold more likely to integrate than a T-DNA 
from pSDMSOOO in YPH250 (Table 2). A similar difference was previously seen 

10 using T-DNAs with the URA3 marker (Bundock and Hooykaas 1996). In 
contrast, T-DNA from pSDMSOOl was only 16-fold more likely to integrate 
than a T-DNA fr-om pSDMSOOO in JKM115. There was no significant 
difference in the frequency of T-DNA transfer to these two yeast strains as was 
determined by T-DNA transfer experiments in which a T-DNA, that carried 

15 the ITanMX marker and the yeast 2 micron replicon, was employed. Therefore, 
the differences in the frequencies of T-DNA integration by HR and IR between 
the yeast strains YPH250 and JKM115, respectively, is most hkely contributed 
to differences in the capacities of their HR and IR recombination machineries. 
We confirmed by PGR that the T-DNA from pSDMSOOl became 

20 integrated at the PDAl locus by homologous recombination (data not shown). 
In order to find out whether the T-DNA from pSDMSOOO had integrated 
randomly by IR yeast target sites for integration were determined from 8 G418 
resistant YPH250 colonies by Vectorette PGR (for detailed description see 
materials and methods). Chromosomal DNA was isolated and digested with a 

25 restriction enzyme that cuts within the T-DNA. A Vectorette was ligated to the 
digested DNA and a PGR was performed using a T-DNA specific and a 
Vectorette specific primer. The PGR product obtained was cloned into 
pGEMTEasy and sequenced using a T-DNA specific primer. The position of the 
T-DNA insertion was determined by basic BLAST search of the yeast genome 

30 (http://www-genome.stanford.edu/SGD). We were thus able to map the position 
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of the T-DNA insertions of all 8 G418 resistant colonies analyzed. They were 
present at different positions spread out over the genome. Comparison of the 
T-DNA sequence and yeast target site sequences did not reveal any ohvious 
homology. These data show that the T-DNA from pSDMSOOO had integrated 
5 via an IR mechanism as expected. 

The following characteristics have previously been observed for T-DNAs 
integrated by IR: a) the 3' end of the T-DNA is usually less conserved 
compared to the 5' end, b) microhomolpgy is sometimes present between the T- 
DNA ends and the target site, c) integration is often accompanied by small 

10 deletions of the target site DNA (Matsumotot et gJ. 1990), (Gheysen et al. 
1991, (Mayerhofer et al. 1991), (Bundock and Hooykaas 1996). Similar 
characteristics were seen in the currently analyzed 8 T-DNA insertions. In 3 
strains we observed microhomology of 2 — 6 bp between the LB and yeast 
target site (figiire 1, WT.51 was taken as an example). In 5 strains deletions of 

15 1 — 5 bp of yeast target site DNA was found and we observed deletions, varying 
from 1 — 112 bp, of the 3' end of the T-DNA in 7 out of 8 analyzed strains. In 
only 1 strain the 3' end appeared to be intact. The 5' end of the T-DNA was 
conserved in almost all strains. In only 2 strains we observed small deletions 
of 1 and 2 bp at the 5' end of the T-DNA. 

20 Thus, we can conclude that the T-DNA from pSDM8000 had integrated ^, 

via the same IR mechanism described before. 



I 
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Table 2: Frequencies of T-DNA integration by IR relative to integration by HR 
in recombination defective yeast strains 



Strain 


Grenotype 


Freq. of 


Preq. of 


Absolute 


Standardized 








HR 


IR/HR 


IR/HR ratio« 










ratio'' 




YPH250 


WT 


1.6 X 10-7 


2.4 X 10-5 


0.007 


1 


YPH250 


radSlA 


1.4 X 10-7 


1.5 X 10-6 


0.09 


14 


rad51 












YPH250 


rad52A 


3.8 X 10-7 


2.5 X 10-6 


0.15 


23 


rad52 












YPH250 


, yku70A 


<3.2 X 10-9 


3.3 X 10-5 


<0.0001 


<0.01 


Ir IV €/€/ # \^ 












YPH250 


radSOA 


8.0 X 10-3 


3 5 X 10-5 


0.0002 


0.03 


radSO 












YPH250 


lig4A 


3.7 X 10-9 


2.3 X 10-5 


0.0002 


0.02 


lig4 












JKM115 


WT 


1.2 X 10-5 


1.8 X 10-4 


0.07 


1 


JKM129 


xrs2A 


2.7 X 10-7 


5.1 X 10-5 


0.005 


0.08 


JKM138 


mrellA 


2.9 X 10-7 


7.5 X 10-5 


0.004 


0.06 


YSL204 


sir4A 


1.5 X 10-7 


1.8 X 10-5 


0.008 


0.13 



a Averages of 2 or more independant experiments are shown. Frequencies are 
5 depicted as the number of G418 resistant colonies devided by the output 
number of yeast cells (cells/ml). 

b The frequency of T-DNA integration by IR (pSDMSOOO) devided by the 
frequency of T-DNA integration by HR (pSDMSOOl). 

^ The ratio of IR/HR in the mutant strain devided by the ratio of IR/HR in the 
10 wildtjrpe strain. 
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2, Host-specific factors involved in random T-DNA integration. 

The observation that the T-DNA from pSDMSGGO integrates by IR into 
the yeast genome allowed us to use this system to study the effect of host 
factors on the process of integration. Many proteins involved in various forms 
5 of DNA recombination have been identified in yeast. In order to determineT;he 
roles of a representative set of these enzymes in T-DNA integration, we - 
compared T-DNA transfer and integration in wildtype yeasts with that of 
strains cairying disruptions of the genes encoding several recombination 
proteins. The RAD51, RAD52, KU70, RAD50 and LIG4 genes were deleted 

10 from YPH250 using the one step disruption method. Yeast strains canying 
deletions in MREll, XRS2 and SIR4 in the JKM115 background were kindly 
provided by Dr. J. Haber. The results of cocultivations with these yeast strains 
are shown in Table 2. 

In radSl and rad52 mutants, which are impaired in homologous 

15 recombination, the frequency of T-DNA integration by HR was 16- and 9-fold 
lower, respectively, than observed for the wildtype (Table 2). This showed that 
RAD51 and RAD52 play a role in T-DNA integration by homologous 
recombination. In the IR defective ku70, radSO, lig4, mrell, xrs2 and sir 4 
mutants the frequency of T-DNA integration by HR did not differ significantly 

20 from that observed for wildtype (Table 2). This showed that these genes do not 
play a role in T-DNA integration by homologous recombination. 

The frequency of T-DNA integration by IR in a rodSl mutant did not 
differ significantly fi'om that observed for wildtype, whereas in a rad52 mutant 
the frequency was about 2-fold higher (Table 2). This showed that BAD 51 and 

25 RAD52 are not involved in T-DNA integration by IR. The product of the 
RAD52 gene may compete with IR-enzymes for the T-DNA and thereby 
inhibits integration by IR to some extent. Strikingly, in radSO, mrell, xrs2, 
lig4 and sir4 mutants the frequency of T-DNA integration by IR was reduced 
dramatically: 20- to more than 40-fold (Table 2). T-DNA integration by IR 

30 seemed to be completely aboUshed in the ku70 mutant. We did not obtain any 
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G418 resistant colonies from several cocxiltivation experiments. This strongly 
suggests that KU70 plays an important role in random T-DNA integration in 

yeast. 

Since T-DNA integration by HR is normal in these mutants, these 
5 results clearly show that the yeast genes KU70, RAD50, MREll, XRS2, LICr4 
and SIR4 are involved in T-DNA integration by illegitimate recombination. 

3. Chromosomal distribution of integrated T-DNA copies in IR 
defective S.cerevisiae. 

10 From several cocultivation experiments with the radSO, mrell, xrs2, 

lig4 and sir4 mutants we obtained a small nimiber of G418 resistant colonies. 
The T-DNA structure was determined for a number of these lines. To this end 
chromosomal DNA was isolated from these G418 resistant colonies and 
subjected to vectorette PGR to amplify junction sequences of genomic DNA and 

15 T-DNA. PGR products were cloned and sequenced. The yeast sequences linked 
to the T-DNA were used in a BLAST search at http://www- 
genome.stanford.edu/SGD to determine the T-DNA integration sites. 

Strikingly, analysis of LB/genomic DNA junctions revealed that in 2 out 
of 3 radSO, 4 out of 6 mrell and 2 xrs2 strains analyzed, T-DNAs had 

20 integrated in telomeres or subtelomeric regions {rodSOk.l, radSOk.S, mrellk.8, 
mrellk.ll, mrellk.14, mrellk.17, xrs2k,l and xrs2k.l7; Table 3 and figure 1). 
S. cerevisiae telomeres generally consist of one or more copies of the Y element 
followed by telomerase-generated G(l-3)A/TG(l-3) repeats (Zakian 1996). In 2 
radSO strains, 2 mrell strains arid 1 xrs2 strain the LB was foimd to be fused 

25 to this typical telomerase-generated C(l*3)A/TG(l-3) repeat {rad50k,l, 

radSOk.6, mrellk.l4, mrellk.17 sx\d xrs2kA] figure 1). Besides, we also found 
one T-DNA insertion in a Ty LTR element in the mrell mutant and 2 
insertions in the rDNA region, present in chromosome XII, in the mrell and 
radSO mutants (mrellk.S, mrellk.4 and radSOk.S, respectively; Table 3 and 

30 figure 1). 



wo 02/052026 



PCT/NLOl/00936 



26 

The 3' end of the T-DNA was trxmcated in all strains. Deletions of 3 - 11 
bp of the 3'end of the T-DNA were observed (figure 1). Microhomology between 
the 3' end of the T-DNA and yeast target site was only found in 2 lines (5 bp in 
mrellk.4 and 4 bp in mrellk.14; figure 1). For the T-DNA copies present at 
5 the yeast telomeres, the RB/genomic DNA junction sequences could not be " 
obtained from these strains using vectorette PGR. This was only possible for 
the rad50 and mrell strains carrying the T-DNA in the rDNA region on 
chromosome XII. In both strains the RB was intact and no homology between 
the 5' end of the T-DNA and the yeast target site was fo\md (data not shown in 
10 figure 1). 

Previously, target sites for T-DNA integration in the genome of 
S.cerevisiae strain RSY12 were determined (Bimdock and Hooykaas 1996), 
(Bundock 1999. In 4 out of 44 strains analyzed, T-DNA copies were integrated 
in rDNA, Ty LTR elements (in 2 strains) and a subtelomeric located T 

15 element, respectively. In addition, we determined the position of T-DNA 

integration in ten S.cerevisiae YPH250 strains. We did not find any T-DNA 
insertions in rDNA, LTR elements or subtelomeric/telomeric regions amongst 
these ten. Pooling all insertions analyzed in wildtjrpe (54), in 2 out of 54 
strains analyzed (4%) insertions were found in a Ty LTR element and in two 

20 other strains in the rDNA repeat (2%) and a subtelomeric region (2%), 

respectively. In contrast, we report here that T-DNA in yeast strains mutated 
in RAD50, MREll or XRS2 T-DNA integrates preferentially in (sub)telomeric 
regions (8 out of 11 lines: ~73%) of radSO, mrell and xrs2 mutants (table 3). 
From the remaining strains two T-DNAs were present in rDNA and one in a 

25 Ty LTR eleipent, respectively. Apparently, the rDNA repeat is also a preferred 
integration site in these mutants (^18% vs. ^2% in the wildtsrpe). 

Telomeres consist of a large array of telomerase-generated C(l- 
3)A/TG(l-3) repeats (-^350 bp). In the subtelomeric regions two common classes 
of Y* elements, 6.7 and 5.2 kb, can be foxmd (in most strains chromosome I does 

30 not contain Y) (Zakian and Blanton 1988), making the average size of these 
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regions --'6,0 kb. Thus, the yeast genome contains (16 x 2 x 0,35) -f (15 x 2 x 
6,0) = 191 kb of subtelomeric/telomeric sequences. The yeast genome is 12,052 
kb in size, which means that only 1.6% of the genome consists of 
subtelomeric/telomeric sequences. In accordance with this, we observed in only 
5 2% of the wildtjrpe strains T-DNA copies inserted in a subtelomeric region, 
which we wotdd expect on the basis of random T-DNA integration. In contrast, 
in the radSO, mrell and xrs2 mutants 73% of the T-DNA insertions were found 
in the (sub)telomeric region. 

Analysis of 7 lines revealed that in the sir4 mutant T-DNA was 

10 integrated randomly into the yeast genome. So, although SIR4 has an effect on 
the efficiency of T-DNA integration by IR, the pattern of T-DNA distribution in 
the transformants seems similar as in the wildtype strain. In the sir4 mutant 
T-DNA integration by IR was characterized by truncation of the 3' end of the 
T-DNA, deletions at the target site and microhomology between the LB and 

15 the target site (data not shown), like this was observed for T-DNA integration 
by IR in the wildtype. 

These results clearly show that in the rad50, mrell and xrs2 mutants 
the T-DNA, if integrated at all, becomes preferentially inserted in telomeres or 
subtelomeric regions and that the genomic distribution of integrated T-DNAs 

20 is altered when compared to wildtype. However, disruption of SIR4 did affect 
the efficiency of T-DNA integration by IR, but not the characteristics of this 
process. 
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Table 3: genomic distribution of T-DNA integrated by IR in radSO, mrell and 
xrs2 mutants in comparison with the wildtype after T-DNA transfer from " 
pSDMSOOO 



Yeast strain 


(Sub)Telomeric 
region 


LTR 


rDNA 


Elsewhere 


radSO mutant 


2 


0 


1 


0 


mrell mutant 


4 


1 


1 


0 


3(rs2 mut£uit 


2 


0 


0 


0 


wildtype 


1 


2 


1 


50 



I 



wo 02/052026 



PCT/NLOl/00936 



29 

DESCRIPTION OF FIGURES 

Figvire 1: Junction sequences of T-DNA and S.cerevisae genomic DNA. 
S.cerevisiae YPH250 (WT), radSO, mrell and xrs2 strains were cocultivated 
5 with LBAl 1 19(pSDM8000). G418 resistant colonies were obtained. - 
Chromosomal DNA was isolated and subjected to Vectorette PGR to determine 
the sequence of genomic DNA flanking the T-DNA. Position of T-DNA 
integration was determined by basic BLAST search of the yeast genome at 
http:/www. genome-stanford.edu/SGD. The Watson strand of genomic DNA 
10 that is fused to the LB or RB is shown in italics. Bold sequences represent 

sequence homology between the LB and target site. The filler DNA sequence is 
underlined and depicted in italics. The numbers above the LB sequences 
represents the number of bp deleted firom the LB. Tel. = telomeric, Subtel. = 
subtelomeric and Int. = intergenic. 

15 

Figure 2: Ahgnment of KU70 homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

20 

Figure 3: Alignment of LIG4 homologues. Sc = Saccharomyces cerevisiae, 
Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted 
as black boxes, homology is depicted as grey boxes and dashes were used to 
optimise alignment. 
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Figure 4: Alignment of MREll homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

Figure 5: Alignment of RAD50 homologues, Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

Figure 6: Alignment of XRCC4 homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. 
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Claims 

1. A method to direct integration of a nucleic acid of interest to a pre- 
determined site, whereby said nucleic acid has homology at or around the said 

5 pre-determined site, in a exikaryote with a preference for non-homologous 
recombination, comprising steering an integration pathway towards 
homologous recombination. 

2. A method to direct nucleic acid integration according to claim 1, 
comprising providing a mutant of a component involved in non-homologous 

10 recombination. 

3. A method to direct nucleic acid integration according to claim 1 or 2, 
comprising inhibiting a component involved in non-homologous recombination. 

4. A method according to claim 2 or 3 wherein said component involved 
in non-homologous recombination comprises kuTO, radSOy mrell, xrs2, lig4 or 

15 sir 4. 

5. A method to direct integration of a nucleic acid of interest to a pre- 
determined site according to anyone of claims 1 to 3, wherein said nucleic acid 
sequence of interest is essentiallj'^ replacing a sequence within said eukaryote, 

6. A method to direct integration of a nucleic acid of interest to a pre- 
20 determined site according to claim 5, wherein said component involved in 

non-homolgous recombination comprises radSO or ocrs2. 

7. A method to direct integration of a nucleic acid of interest to a sub- 
telomeric and/or telomeric region in a eukaryote with a preference for non- 
homologous recombination by providing a mutant of a component involved in 

25 non-homologous recombination, 

8. A method to direct integration of a nucleic acid of interest to a sub- 
telomeric and/or telomeric region in a eukaryote with a preference for non- 
homologous recombination, comprising inhibiting a component involved in 
non-homologous recombination 
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9. A method to direct integration according to claim 7 or 8 wherein said 
component involved in non-homologous recombination comprises radSO, mrell 
or ocrs2, 

10. A method according to anyone of claims 1 to 9 wherein said 
5 eukaryote comprises yeast, a fungus or an animal. 

11. A method according to anyone of claims 1 to 10, wherein said nucleic 
acid of interest is delivered to a cell of said eukaryote by Agrobacterium. 

12. A method according to anyone of claims 1-11 comprising transiently 
inhibiting integration via non-homologous recombination. 

10 13. A method according claim 12 wherein said transiently inhibiting is 

provided by an Agrobacterium Vir-fusion protein capable of inhibiting a 
component involved in non-homologous recombination. 

14. A method to direct nucleic acid integration according to claim 13 
wherein said Agrobacterium Vir fusion protein comprises VirF or VirE2. 

15 14. A method according to claim 13 or 14 wherein said component 

involved in non-homologous recombination comprises ku70, radSO, mrell, 
xrs2, lig4 or sir 4. 

15. A method according to anyone of the aforegoing claims wherein said 
nucleic acid of interest comprises an inactive gene to replace an active gene. 

20 16. A method according to anyone of claims 1-14, wherein said nucleic 

acid of interest comprises an active gene to replace an inactive gene. 

17. A method according to anyone of claims 1-14, wherein said nucleic 
acid of interest encodes a therapeutic proteinaceous substance. 

18. A method according to anyone of claims 1-14, wherein said nucleic 
25 acid of interest encodes a substance conferring resistance for an antibiotic 

substance to a cell. 

19. A method according to anyone of claims 1-14, wherein said nucleic 
acid of interest confers a desired property to said eukaryotic cell. 

20. A method according to anyone of the aforegoing claims wherein said 
30 nucleic acid of interest is part of a gene delivery vehicle. 
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21. Use of a method according to anyone of claims 1 to 20 for 

improvement of gene targeting efficiency. 



< 



I 
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Fig. 1 



LB' T-DNA RB' 
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-6 
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XV, 1091277, tel. region 
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' GGGGGCarCAGTATTCAATTGTAAAT-CTC CGA-GG 3' 

-25 

' GAGGrAGArGrGAGAGAGTGTGrGrGGGTGTCAAGTCGA 3' 


Xn. 465986, rDNA region 


rad50k.6 
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XV, 1091276, tel. region 
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TCrGGrAGATATATTCAATTGTAAAT-CTC CGA-GG 3' 


Xn. 459692/468829, rDNA region 
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-8 
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-10 

rGrGTGGGrGTGGGTCAATTGTAAAT-CTC CGA-GG 3 ' 
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-1 

' CGTCAAGGATATATTCAATTGTAAAT-CTC CGA-GG 3' 


xn, 1071797, subtel. region 
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Fig. 3 
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Fig. 3, Contd 
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Fig. 4 
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Fig. 5 
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Fig. 6 
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